skip to main content


Search for: All records

Creators/Authors contains: "Smith, Jennifer A."

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Kutalik, Zoltán (Ed.)

    Epigenetic researchers often evaluate DNA methylation as a potential mediator of the effect of social/environmental exposures on a health outcome. Modern statistical methods for jointly evaluating many mediators have not been widely adopted. We compare seven methods for high-dimensional mediation analysis with continuous outcomes through both diverse simulations and analysis of DNAm data from a large multi-ethnic cohort in the United States, while providing an R package for their seamless implementation and adoption. Among the considered choices, the best-performing methods for detecting active mediators in simulations are the Bayesian sparse linear mixed model (BSLMM) and high-dimensional mediation analysis (HDMA); while the preferred methods for estimating the global mediation effect are high-dimensional linear mediation analysis (HILMA) and principal component mediation analysis (PCMA). We provide guidelines for epigenetic researchers on choosing the best method in practice and offer suggestions for future methodological development.

     
    more » « less
    Free, publicly-accessible full text available November 7, 2024
  2. Abstract

    Mediation hypothesis testing for a large number of mediators is challenging due to the composite structure of the null hypothesis, (: effect of the exposure on the mediator after adjusting for confounders; : effect of the mediator on the outcome after adjusting for exposure and confounders). In this paper, we reviewed three classes of methods for large‐scale one at a time mediation hypothesis testing. These methods are commonly used for continuous outcomes and continuous mediators assuming there is no exposure‐mediator interaction so that the product has a causal interpretation as the indirect effect. The first class of methods ignores the impact of different structures under the composite null hypothesis, namely, (1) ; (2) ; and (3) . The second class of methods weights the reference distribution under each case of the null to form a mixture reference distribution. The third class constructs a composite test statistic using the threepvalues obtained under each case of the null so that the reference distribution of the composite statistic is approximately . In addition to these existing methods, we developed the Sobel‐comp method belonging to the second class, which uses a corrected mixture reference distribution for Sobel's test statistic. We performed extensive simulation studies to compare all six methods belonging to these three classes in terms of the false positive rates (FPRs) under the null hypothesis and the true positive rates under the alternative hypothesis. We found that the second class of methods which uses a mixture reference distribution could best maintain the FPRs at the nominal level under the null hypothesis and had the greatest true positive rates under the alternative hypothesis. We applied all methods to study the mediation mechanism of DNA methylation sites in the pathway from adult socioeconomic status to glycated hemoglobin level using data from the Multi‐Ethnic Study of Atherosclerosis (MESA). We provide guidelines for choosing the optimal mediation hypothesis testing method in practice and develop an R packagemedScanavailable on the CRAN for implementing all the six methods.

     
    more » « less
  3. Abstract Background

    Uncovering the functional relevance underlying verbal declarative memory (VDM) genome-wide association study (GWAS) results may facilitate the development of interventions to reduce age-related memory decline and dementia.

    Methods

    We performed multi-omics and pathway enrichment analyses of paragraph (PAR-dr) and word list (WL-dr) delayed recall GWAS from 29,076 older non-demented individuals of European descent. We assessed the relationship between single-variant associations and expression quantitative trait loci (eQTLs) in 44 tissues and methylation quantitative trait loci (meQTLs) in the hippocampus. We determined the relationship between gene associations and transcript levels in 53 tissues, annotation as immune genes, and regulation by transcription factors (TFs) and microRNAs. To identify significant pathways, gene set enrichment was tested in each cohort and meta-analyzed across cohorts. Analyses of differential expression in brain tissues were conducted for pathway component genes.

    Results

    The single-variant associations of VDM showed significant linkage disequilibrium (LD) with eQTLs across all tissues and meQTLs within the hippocampus. Stronger WL-dr gene associations correlated with reduced expression in four brain tissues, including the hippocampus. More robust PAR-dr and/or WL-dr gene associations were intricately linked with immunity and were influenced by 31 TFs and 2 microRNAs. Six pathways, including type I diabetes, exhibited significant associations with both PAR-dr and WL-dr. These pathways included fifteen MHC genes intricately linked to VDM performance, showing diverse expression patterns based on cognitive status in brain tissues.

    Conclusions

    VDM genetic associations influence expression regulation via eQTLs and meQTLs. The involvement of TFs, microRNAs, MHC genes, and immune-related pathways contributes to VDM performance in older individuals.

     
    more » « less
  4. Graduate students emerging from STEM programs face inequitable professional landscapes in which their ability to practice inclusive and effective science communication with interdisciplinary and public audiences is essential to their success. Yet these students are rarely offered the opportunity to learn and practice inclusive science communication in their graduate programs. Moreover, minoritized students rarely have the opportunity to validate their experiences among peers and develop professional sensibilities through research training. In this article, the authors offer the Science Communication (Sci/Comm) Scholar’s working group at The University of Texas at San Antonio as one model for training graduate students in human dimensions and inclusive science communication for effective public engagement in thesis projects and beyond. The faculty facilitated peer-to-peer working group encouraged participation by women who often face inequities in STEM workplaces. Early results indicate that team-based training in both the science and art of public engagement provides critical exposure to help students understand the methodological care needed for human dimensions research, and to facilitate narrative-based citizen science engagements. The authors demonstrate this through several brief profiles of environmental science graduate students’ thesis projects. Each case emphasizes the importance of research design for public engagement via quantitative surveys and narrative-based science communication interventions. Through a faculty facilitated peer-to-peer working group framework, research design and methodological care function as an integration point for social scientific and rhetorical training for inclusive science communication with diverse audiences. 
    more » « less
  5. Low socioeconomic status (SES) and living in a disadvantaged neighborhood are associated with poor cardiovascular health. Multiple lines of evidence have linked DNA methylation to both cardiovascular risk factors and social disadvantage indicators. However, limited research has investigated the role of DNA methylation in mediating the associations of individual- and neighborhood-level disadvantage with multiple cardiovascular risk factors in large, multi-ethnic, population-based cohorts. We examined whether disadvantage at the individual level (childhood and adult SES) and neighborhood level (summary neighborhood SES as assessed by Census data and social environment as assessed by perceptions of aesthetic quality, safety, and social cohesion) were associated with 11 cardiovascular risk factors including measures of obesity, diabetes, lipids, and hypertension in 1,154 participants from the Multi-Ethnic Study of Atherosclerosis (MESA). For significant associations, we conducted epigenome-wide mediation analysis to identify methylation sites mediating the relationship between individual/neighborhood disadvantage and cardiovascular risk factors using the JT-Comp method that assesses sparse mediation effects under a composite null hypothesis. In models adjusting for age, sex, race/ethnicity, smoking, medication use, and genetic principal components of ancestry, epigenetic mediation was detected for the associations of adult SES with body mass index (BMI), insulin, and high-density lipoprotein cholesterol (HDL-C), as well as for the association between neighborhood socioeconomic disadvantage and HDL-C at FDRq< 0.05. The 410 CpG mediators identified for the SES-BMI association were enriched for CpGs associated with gene expression (expression quantitative trait methylation loci, or eQTMs), and corresponding genes were enriched in antigen processing and presentation pathways. For cardiovascular risk factors other than BMI, most of the epigenetic mediators lost significance after controlling for BMI. However, 43 methylation sites showed evidence of mediating the neighborhood socioeconomic disadvantage and HDL-C association after BMI adjustment. The identified mediators were enriched for eQTMs, and corresponding genes were enriched in inflammatory and apoptotic pathways. Our findings support the hypothesis that DNA methylation acts as a mediator between individual- and neighborhood-level disadvantage and cardiovascular risk factors, and shed light on the potential underlying epigenetic pathways. Future studies are needed to fully elucidate the biological mechanisms that link social disadvantage to poor cardiovascular health.

     
    more » « less
  6. Abstract

    Causal mediation analysis aims to characterize an exposure's effect on an outcome and quantify the indirect effect that acts through a given mediator or a group of mediators of interest. With the increasing availability of measurements on a large number of potential mediators, like the epigenome or the microbiome, new statistical methods are needed to simultaneously accommodate high-dimensional mediators while directly target penalization of the natural indirect effect (NIE) for active mediator identification. Here, we develop two novel prior models for identification of active mediators in high-dimensional mediation analysis through penalizing NIEs in a Bayesian paradigm. Both methods specify a joint prior distribution on the exposure-mediator effect and mediator-outcome effect with either (a) a four-component Gaussian mixture prior or (b) a product threshold Gaussian prior. By jointly modelling the two parameters that contribute to the NIE, the proposed methods enable penalization on their product in a targeted way. Resultant inference can take into account the four-component composite structure underlying the NIE. We show through simulations that the proposed methods improve both selection and estimation accuracy compared to other competing methods. We applied our methods for an in-depth analysis of two ongoing epidemiologic studies: the Multi-Ethnic Study of Atherosclerosis (MESA) and the LIFECODES birth cohort. The identified active mediators in both studies reveal important biological pathways for understanding disease mechanisms.

     
    more » « less
  7. We consider Bayesian high‐dimensional mediation analysis to identify among a large set of correlated potential mediators the active ones that mediate the effect from an exposure variable to an outcome of interest. Correlations among mediators are commonly observed in modern data analysis; examples include the activated voxels within connected regions in brain image data, regulatory signals driven by gene networks in genome data, and correlated exposure data from the same source. When correlations are present among active mediators, mediation analysis that fails to account for such correlation can be suboptimal and may lead to a loss of power in identifying active mediators. Building upon a recent high‐dimensional mediation analysis framework, we propose two Bayesian hierarchical models, one with a Gaussian mixture prior that enables correlated mediator selection and the other with a Potts mixture prior that accounts for the correlation among active mediators in mediation analysis. We develop efficient sampling algorithms for both methods. Various simulations demonstrate that our methods enable effective identification of correlated active mediators, which could be missed by using existing methods that assume prior independence among active mediators. The proposed methods are applied to the LIFECODES birth cohort and the Multi‐Ethnic Study of Atherosclerosis (MESA) and identified new active mediators with important biological implications.

     
    more » « less
  8. Abstract

    Causal mediation analysis aims to examine the role of a mediator or a group of mediators that lie in the pathway between an exposure and an outcome. Recent biomedical studies often involve a large number of potential mediators based on high‐throughput technologies. Most of the current analytic methods focus on settings with one or a moderate number of potential mediators. With the expanding growth of ‐omics data, joint analysis of molecular‐level genomics data with epidemiological data through mediation analysis is becoming more common. However, such joint analysis requires methods that can simultaneously accommodate high‐dimensional mediators and that are currently lacking. To address this problem, we develop a Bayesian inference method using continuous shrinkage priors to extend previous causal mediation analysis techniques to a high‐dimensional setting. Simulations demonstrate that our method improves the power of global mediation analysis compared to simpler alternatives and has decent performance to identify true nonnull contributions to the mediation effects of the pathway. The Bayesian method also helps us to understand the structure of the composite null cases for inactive mediators in the pathway. We applied our method to Multi‐Ethnic Study of Atherosclerosis and identified DNA methylation regions that may actively mediate the effect of socioeconomic status on cardiometabolic outcomes.

     
    more » « less